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CELLOBIOHYDROLASE REDUCED GLYCOSYLATION VARIANTS: CBHIN45A, 

CBHIN270A; AND CBHIN384A 

Technical Field 

This invention relates to exoglucanases. More specifically, it relates to Trichoderma 
reesei cellobiohydrolase I reduced glycosylation variants which enable expression of the active 
enzyme in a heterologous host. 

Background Art. 

The surface chemistiy of acid pretreated-biomass, used in ethanol production, is diflFerent 
fi-om that found in plant tissues, naturally digested by fimgal cellulase enzymes, in two important 
ways: (1) pretreatment heats the substrate past the phase-transition tenq>erature of lignin; and (2) 
pretreated biomass contains less acetylated hemicellulose. Thus, it is believed, that the ceUulose 
fibers of pretreated-biomass are coated with displaced and modified lignin This alteration results 
in a non-specific binding of the protein with the biomass, which impedes enzymatic activity. 
Moreover, where the pretreated biomass is a hardwood-pulp it contains a weak net-negatively 
charged surface, which is not observed in native wood. Therefore, for the effident production of 
ethanol fi-om pretreated biomass it is desirable to enhance the catalytic activity of ^cosyl 
hydrolases on acid hydrolyzed hardwoods. 

T Hchoderma reesei CBH I is a mesophilic cellulase enzyme, and comprises a ms^or 
catalyst in the overall hydrolysis of cellulose. An artificial ternary cellulase system consisting of a 
90:10:2 mixture of J. reesei CBH I, A. cellulofyticusm, mdA. niger p-D-glucosidase is capable 
of releasing as much reducing sugar fi-om pretreated yellow poplar as the native T, reesei system 
after 120 h. This result is encouraging for the ultimate success of engineered ceUulase systems, 
because this artificial enzyme system was tested at SOX, a temperature fer below that considered 
optimal for EI, in order to spare the more heat labile enzymes CBH I and p-D-glucosidase. In 
order to increase the efficiency of such artificial enzyme systems it is desirable to engineer new T. 
reesei CBH I variant oizymes capable of active expression in a heterologous host. The 
heterologous host A^rgillus awamori, could provide an excellent capacity for syntheas and 
secretion of T. reesei CBH I because of its ability to correctly fold and post-translationaUy modify 
proteins of oikaryotic origin. Moreover, A. awamori is believed to be an excellent test-bed for 
Trichoderma coding sequences and resolves some of the problems assodated with direct site 
directed mutagenesis in Trichoderma. 

In consideration of the foregoing, it is therefore desirable to provide variant celhilase 
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enzymes having enzymatic activity when expressed in an heterologous host. 



Disclosure of Invention 

It is a general object of the present invention to provide variant cellulase enzymes having 
5 enzymatic activity when expressed in a heterologous host, such as a filamentous fiingi or yeast. 

Another object of the invention is to provide a variant exoglucanase characterized by a 
reduction in glycosylation when expressed in a heterologous host. 

Another object of the invention is to provide an active cellobiohydrolase enzyme capable 
of expression in heterologous fungi or yeast. 
10 It is yet another object of the invention to provide a method for reducing the glycosylation 

of a cellobiohydrolase enzyme for expression in a heterologous host. 

The foregoing specific objects and advantages of the invention are illustrative of those 
which can be achieved by the present invention and are not intended to be exhaustive or limiting 
of the possible 2ulvantages which can be realized Thus, those and other objects and advantages 
IS of the invention will be apparent fi^om the description herein or can be learned fi^om practicing the 
invention, both as embodied herein or as modified in view of any variations which may be 
apparent to those skilled in the art. 

Briefly, the invention provides a method for making an active exoglucanase in a 
heterologous host, the method comprising reducing glycosylation of the exoglucanase, reducing 
20 glycosylation fiirther comprising replacing an N-glycosylation site amino acid residue with a non- 
glycosyl accepting amino acid residue. The invention fiirther provides a cellobiohydrolase, 
comprising the reduced glycosylation variant cellobiose enzymes CBHIN45 A; CBHIN270A; or 
CBHIN384A, or any combination thereof 

25 Best Mode for Carrying out the Invention. 

Unless specifically defined otherwise, all technical or scientific terms used herein have the 
same meaning as commonly imderstood by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or equivalent to those described herein can 
be used in the practice or testing of the present invention, the preferred methods and materials are 

3 0 now described . 

A method for reducing the glycosylation of an expressed Trichoderma reesei CBHI 
protein by site-directed mutagenesis ("SDM") is disclosed. The method includes replacing an N- 
gfycosylation site amino acid residue, such as asparagines 45, 270, and/or 384 of SEQ. ID NO: 4 
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(referenced herein as CBHIN45A, CBHIN270A and CBIDN384A, respectively), with a non- 
glycosyl accepting amino acid residue, such as is alanine. Various mutagenesis kits for SDM are 
available to those skilled in the art and the methods for SDM are well known. The description 
below discloses a procedure for making and using CBHI variants: CBHIN45A; CBHIN270A, and 
5 CBHIN384A. The examples below demonstrate the expression of active CBH I in the 
heterologous fungus Aspergillus awamori. 

Industrial Applicability 

Site-Directed Mutagenesis of Trichoderma reesei CBH I for Reduced Glycosvlation. 

10 Aspergillus awamori was transformed with various versions of the cbhl gene from 

Trichoderma reesei. The cbhl genes included both cDNA and genomic (intron containing) 
versions. These were altered by site-directed mutagenesis for the specific purpose of reducing the 
glycosylation of the expressed CBH I protein through replacement of the N-glycosylation site 
amino acid residues (asparagine) with non-glycosyl accepting amino acid residues (alanine). The 

1 5 gene was propagated in an £1 coli vector plasmid (pPFE2) under the control of the Aspergillus 
awamori glucoamylase promoter and signal sequence, and trpC terminator, and carrying 
resistance to ampicillin (E, coli selection) and Zeocin (Bleomycin) Aspergillus selection. One 
altered rCBH I variant, CBHIN270A, SEQ. ID NO: 2, was isolated from cultures and 
determined to be consistent with native CBH I, SEQ. ID. NO: 4, with respect to kinetics on 

20 /?NPL and was only slightly higher in molecular weight. Thus, construction of the triple reduced 
glycosylation mutant CBH 1, CBHIN270A (SEQ. ID. NO: 2) / CBHIN45A (SEQ. ID. NO: 1) / 
CBHIN384A (SEQ. ID. NO: 3), may provide a viable means of producing active CBH I in 
heterologous fungal or yeast \^ch do not require the cellobiose/lactose induction cascade, 
known in Trichoderma, It is believed that reduced glycosylation CBH I mutants would also serve 

25 effectively in yeast-based high throughput screens, >\diich are normally rendered unusable for 
fungal enzymes because of hyperglycosylation. 

Example 1 . Production of Active Recombinant CBH I (rCBH D m Aspergillus awamori 
Construction of Modified CBH I Coding Sequence. 
30 The coding sequence for T. reesei CBH I (SEQ. ID. NO: 4) was successfully inserted and 

expressed in Aspergillus awamori using the fungal expression vector pPFE2 (and pPFEl), 
Vectors pPFEl and pPFE2 are£. coli-Aspergillus shuttle vectors, and contain elements required 
for maintenance in both hosts. They encode ampicillin resistance for selection in E, coli and 
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Zeocin resistance for selection in Aspergillus, The foregoing provided for the site-directed 
mutagenesis in E, coli, followed by expression of the new mutant proteins in A. awamori. The 
CBH 1 gene is under the control of the A, awcanoh glucoamylase promoter and includes the 
glucoamylase secretion signal peptide. In order to have the signal peptide properly cleaved during 
5 secretion, the construction of this plasmid required the addition, by PCR, of a Notl site and Xbal 
site on the coding sequence of CBH 1 . The Notl site addition resulted in a change of the most N- 
terminal amino acid on the protein from glutamine to glycine. This glycine was subsequently 
changed back to the native glutamine in the pPFE2/CBHI constmct, using site-directed 
mutagenesis PCR This new construct was used to transform A, awamori and to express rCBH I, 

10 as confirmed by western blot analysis of culture supernatant The rCBH I expressed in A. 

awamori tends to be over glycosylated as evidenced by the higher molecular weight observed on 
western blot analysis. Over-glycosylation of CBH I by A. awamori was confirmed by digestion of 
the recombinant protein with endoglycosidases. Following endoglycosidase H and F digestion, 
the higher molecular weight form of the protein collapses to a molecular weight similar to native 

15 CBHl. 

The vector pPFE2/CBHI requires a relatively long PCR reaction (8.2 kb) to make site- 
specific changes using the Stratagene Quik Change protocol. The PCR reaction was optimized as 
follows using a GeneAmp PCR System 2400, Perldn Ebner Corporation. The reaction mixture 
contained 50 ng of template DNA, 125 ng each of the sense and antisense mutagenic primers, 5 jil 

20 of Stratagene lOx cloned Pfu buffer, 200mM of each: dNTP, 5 mM MgCk (total final 

concentration Of MgCk is 7 mM); and 2.5 U Pfu Turbo DNA polymerase. The PCR reaction 
was carried out for 30 cycles, each consisting of one minute denaturation at 96^, 1 minute 
annealing at 69°C, and 20-minute extension at 75°C, There is an initial denaturation for 2 minutes 
at 96°C and a final extension for 10 mmutes at 75°C, foUowed by a hold at 4'*C. Agarose gel 

25 electrophoresis, ethidium bromide staining, and visualization under UV transillumination were 
used to confirm the presence of a PCR product. 

PCR products were digested with restriction enzyme Dpnl, to degrade un-mutagenized 
parental DNA, and transformed into E. coli (Stratagene Epicurian Coli Supercompetent XL-1 
Cells). Amp^ colonies were picked fi-om LB-Amp^^ plates and mutations were confirmed by 

30 DNA sequencing. Depending on scale, plasmid DNA was purified using the Qiagen QiaPrep Spin 
Mmiprep Kit or the Promega Wizard Plus MaxiPrep DNA Purification System. 

Transformation of Aspergillus awamori with Trichoderma reesei CBH I co ding sequence. 
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Aspergillus awamori spore stocks were stored at -lO^'C in 20% glycerol 10% lactose. 
After thawing, 200 jiL of spores were inoculated into 50 mL CM broth in each of eight baffled 
250 mL Erlenmeyer flask. The cultures were grown at 28°C, 225 rpm for 48 h The myceUa 
were removed by filtration with sterile Miracloth, Calbiochem, San Diego, C A, and washed 
5 thoroughly with sterile KCM. Approximately 1 0 g of washed mycelia were transferred to 50 mL 
KCM + 250 mg Novozym234 in a 250 mL baflQed Erlenmeyer flask. The digestion mixture was 
incubated at 30°C, 80 rpm for 16-1 8 h. Spheroplasts wiere filtered through Miracloth into 50 mL 
conical centrifuge tubes, pelleted at 2000xg for 15 min and re-suspended in 0.7M KCl by gentle 
tituration with a 25 mL pipette. This procedure was repeated once After a third pelleting, the 

10 spheroplasts were resuspended in 1 0 mL KC, pelleted and resuspended in 0.5 mL KC using a 
wide-bore pipet tip. The washed spheroplasts were transformed by adding 12.5 ^L PCM and 5 
mL DNA («0.5 jxg/^L) to 50 \xL of spheroplast in sterile 1 .5 mL Eppendorf tubes. After 
incubation on ice for 45 minutes, 0.5 mL of room temperature PCM was added to the 
transformation mixture and was mixed by tituration with a wide bore pipet tip. The mixture was 

1 5 incubated at room temperature for 45 minutes. One milliliter of KC was added and nuxed. The 
mixture was allocated between four tubes of molten CM top agar at 55**C, which were each 
poured over a 15 mL CM 170 plate. The plates were incubated at 28''C for 2-3 days. Subsurface 
colonies were partially picked with a sterile wide bore pipet tip, exposing the remaining part of the 
colony to air and promoting rapid sporulation. After sporulation, spores were streaked onto 

20 several successive CMIOO plates. After a monoculture was established, heavily sporulated plates 
were flooded with sterile spore suspension medium (20% glycerol, 10% lactose), the spores were 
suspended and aliquots were fi-ozen at -70® C. Protein production was confirmed and followed by 
western blot using anti-CBH I monoclonal antibodies and the Novex Western Breeze anti-mouse 
chromogenic detection kit (Novex, San Diego, CA). Extracting genomic DNA using the YeaStar 

25 Genomic DNA Kit (Zymo Research, Orange, CA) and carrying out PCR with /7^-tuibo DNA 
polymerase (Stratagene, La JoUa, CA) and cbhl primers confirmed insertion of the gene. 

Production of Recombinant Enzyme. 

For enzyme production, spores were inoculated into 50 mL CM maltose medium, pH 5.0, 
30 and grown at 32''C, 225 rpm in 250 mL bafiSed flasks. The cultures were transferred to 1 .0 L of 
CM maltose in 2,800 mL Fembach flasks and grown imder similar conditions. For large-scale 
enzyme production (>1 mg), these cultures were transferred to 10-L CM maltose in a New 
Brunswick BioFloBOOO chemostat (10-L working volume) maintained at: 20% DO; pH 4.5; 25*='C; 
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and 300 rpm The culture was harvested by filtration through Miracloth after 2-3 days of growth 
For the 10-L fermentation broth, the filtrate was concentrated and dia-flltered into 20mM sodium 
acetate pH 5 .0 by tangential flow ultrafiltration with an Amicon DC30 concentrator equipped 
with a single 10,000 MWCO hollow fiber cartridge (1. Inun I D,, 2.4 m^ surface area). The 
5 retentate fi-om the 10-L concentration or the filtrate firom smaller cultures was clarified in an 
Amicon DC-2 concentrator by tangential flow filtration with two 0. 1 yxm hollow fiber cartridges 
(1.1 mm I.D., 0.03 m^ surface area, Millipore, Bedford, MA). The permeate was fiirther 
concentrated with an Amicon CH-2 concentrator equipped with three 10,000 MWCO hollow 
fiber cartridges (1.1 mm I.D., 0.03 m^ surface area) The final concentrate was sterile filtered 

10 through a 0.45 \im filter and stored at A'^C until used 

The recombinant CBH I protein, SEQ. ID NO.: 4, was purified by passing the 
concentrated culture broth over two or three CBinD900 cartridge columns (Novagen, Madison, 
WI) connected in series using a Pharmacia FPLC loading at 1 .0 nnL/min. (Amersham Pharmacia 
Biotech, Inc., Piscataway, NJ). The cartridges were equilibrated in 20 mM Bis-Tris pH 6.5 prior 

15 to loading and washed with the same buffer after loading. The boimd rCBH I was eluted with 
100% eth)dene glycol (3 mL/colunm) by hand, using a sjoinge. The ehited rCBH I was 
concentrated in an Amicon 10 mL stirred cell using a 25 nmi PMIO membrane to <2.0 mL and 
loaded onto a Pharmacia SuperDex200 16/60 size-exclusion column. The mobile phase was: 20 
mM sodium acetate; 100 mM sodium chloride; and 0.02% sodium azide, pH 5.0 running at 1 .0 

20 roL/min. The eluted protein was concentrated by stirred cell and stored at 4''C. Concentration 
was determined by A280 using the extinction coeflBcient and molecular weight calculated for 
individual protdns by the ProtParam tool on the ExPASy website 

(http://expasy.ch/tools/protparam.html . Below are the formulations for the various media 
described herein: 

25 

Clutterbuck's Salts (20X) 



Na2N03 120.0 g/L 

KCl 10.4 g/L 

MgSO4*7H20 10.4 g/L 

30 KH2PO4 30.4 g/L 

CM- Yeast Extract- 5g/L 

Tryptone- 5g/L 

CHucose- lOg/L 
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Clutterbuck's SaIts-50mL 
Add above to 900mL dH20, pH to 7.5, bring to lOOOmL 
CM Agar - CM+ 20g/L Agar 
CMK CM Agar+ 0 7M KCl 
5 CM 100- CM + 1 00 g/mL Zeocin (Invitrogen, Carlsbad, C A) 

CM 1070- CM+ 1 70 g/mL Zeocin 
KCl- 0.7MKC1 
KC- 0.7M KCl + 50mM CaCI2 
KCM- 0.7M KCl + lOmM MOPS, pH 5.8 
10 PCM 40% PEG 8000, SOmM CaCk, lOmM MOPS pH 5.8 



Example 2. Production of Reduced Glvcosylation rCBH 1; Sites N270A: N45A: and N384A 

rCBHl/pPFE2 has been optimized using site-directed mutagenesis to achieve expression of 
native molecular weight CBH I in A. awamorh The QuickChange SDM kit (Stratagene, San 

15 Diego, Ca) was used to make point mutations, switch amino acids, and delete or insert amino 
acids in the native CBHl gene sequence. The Quick Change SDM technique was performed 
using thermotolerant Pfu DNA polymerase, which replicates both plasmid strands with high 
fidelity and without displacing the mutant oligonucleotide primers. The procedure used the 
polymerase chain reaction (PCR) to modify the cloned CBHl DNA, The basic procedure used a 

20 supercoiled double stranded DNA (dsDNA) vector, with an insert of interest, and two synthetic 
oligonucleotide primers containing a desired mutation. The oligonucleotide primers, each 
complimentary to opposite strands of the vector, extend during temperature cychng by means of 
the polymerase. On incorporation of the primers, a mutated plasmid containing staggered nicks 
was generated. Following temperature cycling, the product was treated with a Dpnl restriction 

25 enzyme. Dpnl is specific for methylated and hemi-meth>dated DNA and thus digests the 

unmutated parental DNA template, selecting for the mutation-containing, newly-synthesized 
DNA. The nicked vector DNA, containing the desired mutations, was then transformed into E. 
coll. The small amount of template DNA required to perform this reaction, and the higji fidelity 
of the Pfu DNA polymerase contribute to the high mutation efficiency and minimizes the potential 

30 for the introduction of random mutations Three glycosylation-site amino acids on the protein 

surfece were targeted for substitution of an alanine (A) residue in place of asparagine (N) Single 
site substitutions were successfully completed in the CBH I coding sequence at sites N45, N270, 
and N384, of SEQ. ID NO.: 4 by site-directed mutagenesis, and confirmed by DNA sequencing. 
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Tabic 1. 


• 
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CONSTRUCT 


HOST 


MW 


Km 


Vmax 


T. reesei 


none 


57 8 IcDa 


1 94 


0.746 


rCBH I wt cDNA# 


A. awamori 


63 .3 kDa 


2.14 


0.668 


rCBH I wt genomic 


A. awamori 


63 .3 kDa 


— 


— 


rCBHIN270A 


A. awamori 


61.7 kDa 


2.25 


0.489 



As shown in Table 1, Western blot analysis of the supernatant, obtained from a single 
10 glycosylation-site mutant CBHIN270 A (SEQ. ID NO. . 2) cuhure expressed in A. awamori, 
demonstrated that a decrease, to a lower molecular weight (61 .7 kDa), in the amount of protein 
had occurred, as compared to the that in the wild type cDNA (63 .3 kDa), and the wild type 
genomic DNA (63 .3 kDa). These results demonstrate a reduction in the level of glycosylation in 
the reduced glycosylation mutant CBHIN270A, via expression in A. awamori. It is also shown, in 
15 the Table, that the CBHIN270A enzyme nearly retained its native enzymatic activity when 

assayed using the/?NPL substrate. While not shown in the Table, variants CBinN45A (SEQ. ID 
NO.: 1), and CBHI384A (SEQ. ID NO.: 3) have also demonstrated a reduction in amount of 
glycosylation and native activity when expressed from the heterologous host A. awamori. 

20 Example 3. Production of Reduced Glvc o svlation rCBH 1: Double and Triple Mutants 

Double and triple combinations of this substitution have also been completed in the CBH I 
coding sequence (SEQ. ID NO.: 4) at sites N45, N270, and N384 by site-directed mutagenesis 
and confirmed by DNA sequencing. These double and triple-site constructs will also yield rCBH I 
enzymes with reduced glycosylation and, presumably, native activity. 

25 

Mutagenic Primers Use d in Site-directed Mutag e nesis PCR 

Notl, Xbal insertion for vector construction 
Mutagenic primers 

30 C-terminal strand (Xbal): AGAGAGTCTAGACACGGAGCTTACAGGC 

N-terminal strand (Notl): 

AAAGAAGCGCGGCCGCGCCTGCACTCTCCAATCGG 
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Repair of Notl site to native sequence 
Mutagenic primers 

sense strand GGCAAATGTGATTTCCAAGCGCCAGTCGGCCTGCACTCTCC 
5 antisense strand GGAGAGTGCAGGCCGACTGGCGCTTGGAAATCACATTTGCC 

N45A glycosylation site mutation 
Mutagenic primers 

sense strand- GGACTCACGCTACGGCCAGCAGCACGAACTGC 
10 antisense strand: GCAGTTCGTGCTGCTGGCCGTAGCGTGAGTCC 



N270A glycosylation site mutation 
Mutagenic primers 

sense strand: CCCATACCGCCTGGGCGCCACCAGCTTCTACGGCCC 
1 5 antisense strand: GGGCCGTAGAAGCTGGTGGCGCCCAGGCGGTATGGG 

N384A glycosylation site mutation 
Mutagenic primers 

sense strand: GGACTCCACCTACCCGACAGCCGAGACCTCCTCCACACCCG 

20 antisense strand: 

CGGGTGTGGAGGAGGTCTCGGCTGTCGGGTAGGTGGAGTCC 

The foregoing description is consid^ed as illustrative only of the principles of the 
invention. Furthermore, since nimoierous modifications and changes will readily occur to those 
25 skilled in the art, it is not desired to limit the invCTtion to the exact construction and process 
shown as described above. Accordingly, all suitable modifications and equivalents may be 
resorted to falling within the scope of the invention as defined by the claims vMch follow. 
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Claims 



A method for making an active exoglucanase in a eukaryotic heterologous host. 



the method comprising reducing glycosylation of the exoglucanase, wherein reducing comprises 
replacing an N-glycosylation site amino acid residue with non-glycosyl accepting amino acid 



The method of claim 1, wherein the N-glycosylation site amino acid residues 



include asparagine 45, 270, or 384 of SEQ ID NO: 4 and the non-glycosyl accepting amino acid 
residue includes alanine. 



The method of claims 1 wherein replacing comprises site-directed-mutagenesis. 

The methods of claims 1 wherein the exoglucanase comprises a cellobiohydrolase. 

An exoglucanase, comprising SEQ. ID. NO: 1 . 

An exoglucanase, comprising SEQ. ID. NO: 2. 

An exoglucanase, comprising SEQ. ID. NO: 3. 

An exoglucanase, comprising a combination of claims 5,6, or 7, 



5 



residue. 
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SEQUENCE LISTING 

<1 ]0> Adney, William S. 
Decker, Stephen R. 
Lantz-McCarter, Suzanne 
Baker, John O. 
Vinzant, Todd B 
Nieves, Rafael A 
Himmel, Michael E. 

<120> CELLOBIOHYDROLASE REDUCED GLYCOSYLATION VARIANTS CBHIN45A; 
CBHIN270A, AND CBHIN384A 

<130> HIMMEL NREL IR# 99-45 

<140> 
<141> 

<160>4 

<170> Patentin Ver. 2.0 

<210> 1 
<211>496 
<212> PRT 

<213> Trichodenna reesei 

<400> 1 

Ghi Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr Trp 
15 10 15 

Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gto Gin Thr Gly Ser Val 
20 25 30 

Val De Asp Ala Asn Trp Arg Trp Thr His Ala Thr Ala Ser Ser Thr 
35 40 45 

Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 
50 55 60 

Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65 70 75 80 

Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe Val 
85 90 95 

Thr Ghi Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 
100 105 110 

Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 
115 120 125 
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2 



Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala Leu 
130 135 140 

Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Thr 
145 150 155 160 

Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Ghi Cys 
165 170 175 

Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly Trp 
180 185 190 

Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly Ser 
195 200 205 

Cys Cys Ser Ghi Met Asp De Trp Glu Ala Asn Ser lie Ser Glu Ala 
210 215 220 

Leu Thr Pro His Pro Cys Thr Thr Val Gly Ghi Glu lie Cys Glu Gly 
225 230 235 240 

Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 
245 250 255 

Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr Ser 



Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu 
275 280 285 

Thr Val Val Thr Gta Phe Glu Thr Ser Gly Ala De Asn Arg Tyr Tyr 
290 295 300 

Val Gta Asn Gly Val Thr Phe Gto Ghi Pro Asn Ala Glu Leu Gly Ser 
305 310 315 320 

Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 
325 330 335 

Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gta Phe 
340 345 350 

Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp Asp 
355 360 365 

Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn 
370 375 380 

Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr Ser 



260 



265 



270 



385 



390 



395 



400 



Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys Val 
405 410 415 



Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn Pro 
420 425 430 

Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg 
435 440 445 

Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin Ser His 
450 455 460 

Tyr Gly Gin Cys Gly Gly He Gly Tyr Ser Gly Pro Thr Val Cys Ala 
465 470 475 480 

Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin Cys Leu 
485 490 495 



<210> 2 
<211>496 
<212> PRT 

<213> Trichoderma reesei 
<400>2 

Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr Trp 
15 10 15 

Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser Val 
20 25 30 

Val ne Asp Ala Asn Trp Arg Trp Thr IBs Ala Thr Asn Ser Ser Thr 
35 40 45 

Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 
50 55 60 

Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65 70 75 80 

Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser De Gly Phe Val 
85 90 95 

Thr Gin Ser Ala Gfai Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 
00 105 1 10 

Ser Asp Thr Thr Tyr CHn Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 
115 120 125 

Phe Asp Val Asp Val Ser Gbi Leu Pro Cys Gly Leu Asn Gly Ala Leu 
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130 135 140 

Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Thr 
145 150 155 160 

Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin Cys 
165 170 175 

Pro Arg Asp Leu Lys Phe De Asn Gly Gin Ala Asn Val Glu Gly Trp 
180 185 190 

CHu Pro Ser Ser Asn Asn Ala Asn Thr Gly De Gly Gly His Gly Ser 
195 200 205 

Cys Cys Ser Glu Met Asp He Trp Glu Ala Asn Ser lie Ser Glu Ala 
210 215 220 

Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu Gly 
225 230 235 240 

Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 
245 250 255 

Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Ala Thr Ser 
260 265 270 

Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu 
275 280 285 

Thr Val Val Thr GHn Phe Glu Thr Ser CHy Ala He Asn Arg Tyr Tyr 
290 295 300 

Val Gin Asn Gly Val Thr Phe <Hn Gto Pro Asn Ala Glu Leu Gly Ser 
305 310 315 320 

Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 
325 330 335 

Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin Phe 
340 345 350 

Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp Asp 
355 360 365 

Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn 
370 375 380 

Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr Ser 
385 390 395 400 

Ser Gly Val Pro Ala Ghi Val Glu Ser Gta Ser Pro Asn Ala Lys Val 
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405 410 415 

Thr Phe Ser Asn He Lys Phe Gly Pro lie Gly Ser Thr Gly Asn Pro 
420 425 430 

Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg 
435 440 445 

Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gta Ser His 
450 455 460 

Tyr Gly Gin Cys Giy Gly He Gly Tyr Ser Gly Pro Thr Val Cys Ala 
465 470 475 480 

Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gbi Cys Leu 
485 490 495 



<210> 3 
<211>496 
<212> PRT 

"^13> Trichodenna reesei 

<400>3 

Gta Ser Ala Cys Thr Leu Gin Ser Glu Thr Ifis Pro Pro Leu Thr Trp 
15 10 15 

Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Ghi Gin Thr Gly Ser Val 
20 25 30 

Val De Asp Ala Asn Trp Arg Trp Thr Ifis Ala Thr Asn Ser Ser Thr 
35 40 45 

Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Ami 
50 55 60 

Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65 70 75 80 

Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe Val 
85 90 95 

Thr Gto Ser Ala Gto Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 
00 105 110 

Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 
115 120 125 

Phe Asp Val A^ Val Ser Ghi Leu Pro Cys Gly Leu Asn Gly Ala Leu 
130 135 140 
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Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Thr 
145 150 155 160 

Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin Cys 
165 170 175 

Pro Arg Asp Leu Lys Phe He Asn Gly Gin Ala Asn Val Glu Gly Trp 
180 185 190 

Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly Ser 
195 200 205 

Cys Cys Ser Glu Met Asp He Trp Glu Ala Asn Ser lie Ser Glu Ala 
210 215 220 

Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu Gly 
225 230 235 240 

Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 
245 250 255 

Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Lai Gfy Asn Thr Ser 
260 265 270 

Phe Tyr Gly Pro Gly Ser Ser Phe Thr Lai Asp Thr Thr Lys Lys Leu 
275 280 285 

Thr Val Val Thr Gta Phe Glu Thr Ser Gly Ala De Asn Arg Tyr Tyr 
290 295 300 

Val Gin Asn Gly Val Thr Phe Ghi Gin Pro Asn Ala Glu Leu Gly Ser 
305 310 315 320 

Tyr Ser CHy Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 
325 330 335 

Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin Phe 
340 345 350 

Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp Asp 
355 360 365 

Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Ala 
370 375 380 

Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr Ser 
385 390 395 400 

Ser Gly Val Pro Ala Gin Val Glu Ser Gta Ser Pro Asn Ala Lys Val 
405 410 415 
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Thr Phe Ser Asn He Lys Phe Gly Pro He Gly Ser Thr Gly Asn Pro 
420 425 430 

Ser Gly Gly Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg 
435 440 445 

Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin Ser His 
450 455 460 

Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser CHy Pro Thr Val Cys Ala 
465 470 475 480 

Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin Cys Leu 
485 490 495 



<210>4 
<211>496 
<212> PRT 

<2I3> Trichoderma reesei 
<400>4 

Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr Trp 
15 10 15 

CHn Lys Cys Ser Ser Gly CHy Thr Cys Thr Gin Gin Thr Gly Ser Val 
20 25 30 

Val De Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser Thr 
35 40 45 

Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 
50 55 60 

Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65 70 75 80 

Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser De Gly Phe Val 
85 90 95 

Thr Ghi Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 
00 105 110 

Ser Asp Thr Thr Tyr CHn Glu Phe Thr Lai Leu CHy Asn CHu Phe Ser 
115 120 125 

Phe Asp Val Asp Val Ser Ghi Leu Pro Cys CHy Leu Asn Gly Ala Leu 
130 135 140 

Tyr Phe Val Ser Met Asp Ala Asp Gly CHy Val Ser Lys Tyr Pro Thr 
145 150 155 160 
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Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin Cys 
165 170 175 

Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly Trp 
180 185 190 

Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly Ue Gly Gly His Gly Ser 
195 200 205 

Cys Cys Ser Glu Met Asp He Trp Glu Ala Asn Ser lie Ser Glu Ala 
210 215 220 

Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu Gly 
225 230 235 240 

Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 
245 250 255 

Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr Ser 
260 265 270 

Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu 
275 280 285 

Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala De Asn Arg Tyr Tyr 
290 295 300 

Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Ghi Leu Gly Ser 
305 310 315 320 

Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 
325 330 335 

CHu Phe Gly GHy Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Ghi Phe 
340 345 350 

Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp A^ 
355 360 365 

A^ Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn 
370 375 380 

Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr Ser 
385 390 395 400 

Ser Gly Val Pro Ala Gin Val Ou Ser Ghi Ser Pro Asn Ala Lys Val 
405 410 415 



Thr Phe Ser Asn lie Lys Phe Gly Pro De Gly Ser Thr Gly Asn Pro 
420 425 430 
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Ser Gly Giy Asn Pro Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg 
435 440 445 

Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin Ser His 
450 455 460 

Tyr Gly Gin Cys Gly Gly fle Gly Tyr Ser Gly Pro Thr Val Cys Ala 
465 470 475 480 

Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin Cys Leu 
485 490 495 
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